hERG classification model based on a combination of support vector machine method and GRIND descriptors.

نویسندگان

  • Qiyuan Li
  • Flemming Steen Jørgensen
  • Tudor Oprea
  • Søren Brunak
  • Olivier Taboureau
چکیده

The human Ether-a-go-go Related Gene (hERG) potassium channel is one of the major critical factors associated with QT interval prolongation and development of arrhythmia called Torsades de Pointes (TdP). It has become a growing concern of both regulatory agencies and pharmaceutical industries who invest substantial effort in the assessment of cardiac toxicity of drugs. The development of in silico tools to filter out potential hERG channel inhibitors in early stages of the drug discovery process is of considerable interest. Here, we describe binary classification models based on a large and diverse library of 495 compounds. The models combine pharmacophore-based GRIND descriptors with a support vector machine (SVM) classifier in order to discriminate between hERG blockers and nonblockers. Our models were applied at different thresholds from 1 to 40 microm and achieved an overall accuracy up to 94% with a Matthews coefficient correlation (MCC) of 0.86 ( F-measure of 0.90 for blockers and 0.95 for nonblockers). The model at a 40 microm threshold showed the best performance and was validated internally (MCC of 0.40 and F-measure of 0.57 for blockers and 0.81 for nonblockers, using a leave-one-out cross-validation). On an external set of 66 compounds, 72% of the set was correctly predicted ( F-measure of 0.86 and 0.34 for blockers and nonblockers, respectively). Finally, the model was also tested on a large set of hERG bioassay data recently made publicly available on PubChem ( http://pubchem.ncbi.nlm.nih.gov/assay/assay.cgi?aid=376) to achieve about 73% accuracy ( F-measure of 0.30 and 0.83 for blockers and nonblockers, respectively). Even if there is still some limitation in the assessment of hERG blockers, the performance of our model shows an improvement between 10% and 20% in the prediction of blockers compared to other methods, which can be useful in the filtering of potential hERG channel inhibitors.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fault diagnosis in a distillation column using a support vector machine based classifier

Fault diagnosis has always been an essential aspect of control system design. This is necessary due to the growing demand for increased performance and safety of industrial systems is discussed. Support vector machine classifier is a new technique based on statistical learning theory and is designed to reduce structural bias. Support vector machine classification in many applications in v...

متن کامل

Acoustic detection of apple mealiness based on support vector machine

Mealiness degrades the quality of apples and plays an important role in fruit market. Therefore, the use of reliable and rapid sensing techniques for nondestructive measurement and sorting of fruits is necessary. In this study, the potential of acoustic signals of rolling apples on an inclined plate as a new technique for nondestructive detection of Red Delicious apple mealiness was investigate...

متن کامل

A new classification method based on pairwise SVM for facial age estimation

This paper presents a practical algorithm for facial age estimation from frontal face image. Facial age estimation generally comprises two key steps including age image representation and age estimation. The anthropometric model used in this study includes computation of eighteen craniofacial ratios and a new accurate skin wrinkles analysis in the first step and a pairwise binary support vector...

متن کامل

Heart Rate Variability Classification using Support Vector Machine and Genetic Algorithm

Background: Electrocardiogram (ECG) is defined as an electrical signal, which represents cardiac activity. Heart rate variability (HRV) as the variation of interval between two consecutive heartbeats represents the balance between the sympathetic and parasympathetic branches of the autonomic nervous system.Objective: In this study, we aimed to evaluate the efficiency of discrete wavelet transfo...

متن کامل

Automatic Interpretation of UltraCam Imagery by Combination of Support Vector Machine and Knowledge-based Systems

With the development of digital sensors, an increasing number of high-resolution images are available. Interpretation of these images is not possible manually, which necessitates seeking for practical, fast and automatic solutions to solve the environmental and location-based management problems. The land cover classification using high-resolution imagery is a difficult process because of the c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Molecular pharmaceutics

دوره 5 1  شماره 

صفحات  -

تاریخ انتشار 2008